ai test
Leveraging Generative AI for Enhancing Automated Assessment in Programming Education Contests
Dascalescu, Stefan, Dumitran, Adrian Marius, Vasiluta, Mihai Alexandru
Competitive programming contests play a crucial role in cultivating computational thinking and algorithmic skills among learners. However, generating comprehensive test cases to effectively assess programming solutions remains resource-intensive and challenging for educators. This paper introduces an innovative NLP-driven method leveraging generative AI (large language models) to automate the creation of high-quality test cases for competitive programming assessments. We extensively evaluated our approach on diverse datasets, including 25 years of Romanian Informatics Olympiad (OJI) data for 5th graders, recent competitions hosted on the Kilonova.ro platform, and the International Informatics Olympiad in Teams (IIOT). Our results demonstrate that AI-generated test cases substantially enhanced assessments, notably identifying previously undetected errors in 67% of the OJI 5th grade programming problems. These improvements underscore the complementary educational value of our technique in formative assessment contexts. By openly sharing our prompts, translated datasets, and methodologies, we offer practical NLP-based tools that educators and contest organizers can readily integrate to enhance assessment quality, reduce workload, and deepen insights into learner performance.
- Europe > Romania > Sud-Vest Oltenia Development Region (0.05)
- Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- Africa > Middle East > Morocco (0.04)
- Education > Assessment & Standards (1.00)
- Education > Educational Setting > K-12 Education (0.93)
The next question after Turing's question: Introducing the Grow-AI test
This study aims to extend the framework for assessing artificial intelligence, called GROW-AI (Growth and Realization of Autonomous Wisdom), designed to answer the question "Can machines grow up?" -- a natural successor to the Turing Test. The methodology applied is based on a system of six primary criteria (C1-C6), each assessed through a specific "game", divided into four arenas that explore both the human dimension and its transposition into AI. All decisions and actions of the entity are recorded in a standardized AI Journal, the primary source for calculating composite scores. The assessment uses the prior expert method to establish initial weights, and the global score -- Grow Up Index -- is calculated as the arithmetic mean of the six scores, with interpretation on maturity thresholds. The results show that the methodology allows for a coherent and comparable assessment of the level of "growth" of AI entities, regardless of their type (robots, software agents, LLMs). The multi-game structure highlights strengths and vulnerable areas, and the use of a unified journal guarantees traceability and replicability in the evaluation. The originality of the work lies in the conceptual transposition of the process of "growing" from the human world to that of artificial intelligence, in an integrated testing format that combines perspectives from psychology, robotics, computer science, and ethics. Through this approach, GROW-AI not only measures performance but also captures the evolutionary path of an AI entity towards maturity.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (17 more...)
- Health & Medicine > Therapeutic Area > Neurology (0.69)
- Energy (0.68)
New AI test can predict which men will benefit from prostate cancer drug
Doctors have developed an artificial intelligence tool that can predict which men with prostate cancer will benefit from a drug that halves the risk of dying. Abiraterone has been described as a "gamechanger" treatment for the disease, which is the most common form of cancer in men in more than 100 countries. It has already helped hundreds of thousands with advanced prostate cancer to live longer. But some countries, including England, have stopped short of offering the "spectacular" drug more widely to men whose disease has not spread. Now a team from the US, UK and Switzerland have built an AI test that shows which men would most likely benefit from abiraterone. The "exciting" breakthrough will enable healthcare systems to roll out the drug to more men, and spare others unnecessary treatment.
- Europe > United Kingdom > England (0.28)
- Europe > Switzerland (0.25)
- North America > United States > Illinois > Cook County > Chicago (0.05)
- (3 more...)
Multi-modal AI for comprehensive breast cancer prognostication
Witowski, Jan, Zeng, Ken, Cappadona, Joseph, Elayoubi, Jailan, Chiru, Elena Diana, Chan, Nancy, Kang, Young-Joon, Howard, Frederick, Ostrovnaya, Irina, Fernandez-Granda, Carlos, Schnabel, Freya, Ozerdem, Ugur, Liu, Kangning, Steinsnyder, Zoe, Thakore, Nitya, Sadic, Mohammad, Yeung, Frank, Liu, Elisa, Hill, Theodore, Swett, Benjamin, Rigau, Danielle, Clayburn, Andrew, Speirs, Valerie, Vetter, Marcus, Sojak, Lina, Soysal, Simone Muenst, Baumhoer, Daniel, Choucair, Khalil, Zong, Yu, Daoud, Lina, Saad, Anas, Abdulsattar, Waleed, Beydoun, Rafic, Pan, Jia-Wern, Makmur, Haslina, Teo, Soo-Hwang, Pak, Linda Ma, Angel, Victor, Zilenaite-Petrulaitiene, Dovile, Laurinavicius, Arvydas, Klar, Natalie, Piening, Brian D., Bifulco, Carlo, Jun, Sun-Young, Yi, Jae Pak, Lim, Su Hyun, Brufsky, Adam, Esteva, Francisco J., Pusztai, Lajos, LeCun, Yann, Geras, Krzysztof J.
Treatment selection in breast cancer is guided by molecular subtypes and clinical characteristics. Recurrence risk assessment plays a crucial role in personalizing treatment. Current methods, including genomic assays, have limited accuracy and clinical utility, leading to suboptimal decisions for many patients. We developed a test for breast cancer patient stratification based on digital pathology and clinical characteristics using novel AI methods. Specifically, we utilized a vision transformer-based pan-cancer foundation model trained with self-supervised learning to extract features from digitized H&E-stained slides. These features were integrated with clinical data to form a multi-modal AI test predicting cancer recurrence and death. The test was developed and evaluated using data from a total of 8,161 breast cancer patients across 15 cohorts originating from seven countries. Of these, 3,502 patients from five cohorts were used exclusively for evaluation, while the remaining patients were used for training. Our test accurately predicted our primary endpoint, disease-free interval, in the five external cohorts (C-index: 0.71 [0.68-0.75], HR: 3.63 [3.02-4.37, p<0.01]). In a direct comparison (N=858), the AI test was more accurate than Oncotype DX, the standard-of-care 21-gene assay, with a C-index of 0.67 [0.61-0.74] versus 0.61 [0.49-0.73], respectively. Additionally, the AI test added independent information to Oncotype DX in a multivariate analysis (HR: 3.11 [1.91-5.09, p<0.01)]). The test demonstrated robust accuracy across all major breast cancer subtypes, including TNBC (C-index: 0.71 [0.62-0.81], HR: 3.81 [2.35-6.17, p=0.02]), where no diagnostic tools are currently recommended by clinical guidelines. These results suggest that our AI test can improve accuracy, extend applicability to a wider range of patients, and enhance access to treatment selection tools.
- Europe > Switzerland > Basel-City > Basel (0.06)
- North America > United States > Illinois > Cook County > Chicago (0.05)
- Europe > United Kingdom > Wales (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
Copy.Ai Test Writing: The Workforce is Rapidly Evolving with Technology
Have you ever created content using Copy.Ai? Copy.ai is an AI-powered copywriter that generates content creators to generate copy. Copy AI's content is always original and they have a few different service models including a free one. Copy.Ai can help you to create the following: Blog Ideas, Blog Intro's, Brand Mission, Brand Voice, Ad Copy, Call-To-Action, Cover Letters, Meta Descriptions, Event Copy, Name Generators, Hashtags, Social Media Posts, Freestyle and more!
AI Tests A 200-Year-Old Evolutionary Theory
This is the tiger longwing, formally known as Heliconius hecale. Artificial intelligence helped confirm one of the oldest ideas in evolution, but it also raised some new questions. The natural world is full of copycats. Pyralid moths use the same high-pitched warning noises as tiger moths to warn away predators, and harmless king snakes and venomous coral snakes have similar coloration. King snakes and pyralid moths are using something called Batesian mimicry, when a harmless species wards off predators by passing itself off as something fiercer or more toxic. That's not a delibrate ruse, of course; it's just that over time, the king snakes that survived long enough to reproduce tended to be the ones more easily mistaken for a coral snake, so they passed on that resemblance to their offspring.